Simple Reliable Multicast for Parallel Processing in Extended LANs
نویسندگان
چکیده
A typical problem for a parallel processing system involves broadcasting large amounts of data from a master to several worker programs. This paper describes a reliablemulticast method to reduce the communication costs of this distribution. Our solution relies on a tuple-space mechanism as implemented in the Synergy system. We present results showing that even a simple implementation of reliable multicast can dramatically improve performance. 1. Background: The Synergy system The tuple space model was introduced in the LINDA [2] system. Synergy [4] is a tuple-space based SIMD parallel processing system that runs on a network of workstations. From a programmer’s point of view, Synergy consists of three types of entities: exactly one master, one or more workers, and one or more tuple spaces. The master and workers are user programs, while tuple spaces are object repositories to/from which the master and workers can write/read objects called tuples. A tuple is a named data object. Once put into a tuple space, tuples are persistent until removed. Another example of a tuple-space based system is LIMBO [1] which uses reliable multicast in conjunction with tuple spaces to provide Quality of Service management in mobile-aware distributed applications. A parallel solution to a problem is implemented in Synergy by having the master divide the problem space into sub-problems, and then put one or more tuples for each subproblem into tuple space. The workers then obtain tuples from the tuple space, compute the (partial) solution and return the solution to the same or another tuple space. The master, after reading all the solution tuples, generates the complete solution and returns the result. Currently, workers use Synergy system calls to read tuples from the tuple space over a network. Prior to this work, Synergy system calls used only unicast communication. 2. Multicasting In parallel computing, the total solution cost is the sum of the computation and communication costs. For many problems the total cost is dominated by the initial problem distribution cost, i.e. the time to distribute the sub-problem and relevant data to the workers. In cases where a large proportion of the sub-problem data to be distributed is identical, multicasting can improve performance. IP Multicasting is inherently unreliable. In unicasting, reliability is acheived by having the sender retransmit. However, unicast reliability techniques are not scalable for multicast due to the well-known ack-implosion problem. Many sophisticated reliable multicast algorithms have been investigated [3]. In this paper, we show that significant gains can be obtained even with a minimal use of multicast. Specifically we have exploited two features of Synergy: (1) All data to be distributed is present in the tuple space and the tuple space is persistent. (2) Workers normally go to the tuple space to get their sub-problems. We shift the responsibility of reliability from the sender to the receiver. The idea can be summarized as: MASTER: (1) Generate the data and place in tuple space. (2) Wait for acks from workers indicating readiness. (3) Multicast the data. WORKER: (1) Send ack to master indicating worker is ready. (2) Receive data via multicast. (3) If the next expected tuple is not received within an (applicationspecific) interval then retrieve the lost data from tuple space using regular Synergy functions. 3. Experiment Design and Analysis of Results The experiment involved distributing an integer matrix to processors, as in a parallel implementation of matrix multiplication. We calculated the average delay over ten repetitions of the experiment for each combination of
منابع مشابه
Connectivity-Based Reliable Multicast MAC Protocol for IEEE 802.11 Wireless LANs
We propose the efficient reliable multicast MAC protocol based on the connectivity information among the recipients. Enhancing the BMMM (Batch Mode Multicast MAC) protocol, the reliable multicast MAC protocol significantly reduces the RAK (Request for ACK) frame transmissions in a reasonable computational time and enhances the MAC performance. By the analytical performance analysis, the through...
متن کاملAn efficient and fair reliable multicast protocol for 802.11-based wireless LANs
Many applications are inherently multicast in nature. Such applications can benefit tremendously from reliable multicast support at the MAC layer since addressing reliability at the MAC level is much less expensive than handling errors at the upper layers. However, the IEEE 802.11 MAC layer does not support reliable multicast. This void in the MAC layer is a limiting factor in the efficacy of m...
متن کاملTighter Throughput Lower Bounds of Connectivity-Based Reliable Multicast MAC Protocol for IEEE 802.11 Wireless LANs
The connectivity-based reliable multicast MAC protocol was proposed for IEEE 802.11 wireless LANs and the formulae for the lower bounds of the multicast downlink throughput and the uplink throughput of the connectivitybased reliable multicast MAC protocol were derived in our previous study. We provide the tighter throughput lower bounds of the connectivity-based reliable multicast MAC protocol ...
متن کاملReliable Multicast in Multi-Access Wireless LANs
Multicast is an efficient paradigm for transmitting data from a sender to a group of receivers. In this paper, we focus on multicast in single channel multi-access wireless local area networks (LANs) comprising several small cells. In such a system, a receiver cannot correctly receive a packet if two or more packets are sent to it at the same time, because the packets “collide”. Therefore, one ...
متن کاملMulticasting in Myrinet — A High-Speed, Wormhole-Routing Network
Wormhole routing networks are emerging as an effective solution for high bandwidth, low latency interconnects in distributed computing and cluster computing applications. An important example (in the local area environment) is the 640 Mb/s crossbar-based Myrinet. A key property of conventional LANs, which is valuable for many distributed applications, is transparent, reliable network level mult...
متن کاملNetwork interface multicast protocols for wormhole-based networks of workstations
Wormhole-routing high-speed LANs (e.g. Myrinet), increasingly used to build highperformance Network of Workstations, do not usually provide hardware support to multicast communication, that must instead be implemented in software by sending several point-to-point messages. Network Interface protocols allow an efficient software implementation of multicast communication, but require flow control...
متن کامل